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ABSTRACT 

We report on an attempt to accurately wavelength calibrate four nights of data taken with 
the Keck HIRES spectrograph on QSO PHL957, for the purpose of determining whether the fine 
structure constant was different in the past. Using new software and techniques, we measured 
the redshifts of various Ni II, Fe II, Si II, etc. lines in a damped Lya system at z = 2.309. 
Roughly half the data were taken through the Keck iodine cell which contains thousands of well 
calibrated iodine lines. Using these iodine exposures to calibrate the normal Th-Ar Keck data 
pipeline output we found absolute wavelength offsets of 500 m/s to 1000 m/s with drifts of more 
than 500 m/s over a single night, and drifts of nearly 2000 m/s over several nights. These offsets 
correspond to an absolute redshift of uncertainty of about Az w 10~ 5 (AA « 0.02A), with daily 
drifts of around Az « 5 x 10~ 6 (AA « O.OlA), and multiday drifts of nearly Az « 2 x 10~ 5 
(» 0.04A). The causes of the wavelength offsets arc not known, but since claimed shifts in the 
fine structure constant would result in velocity shifts of less than 100 m/s, this level of systematic 
uncertainty may make it difficult to use Keck HIRES data to constrain the change in the fine 
structure constant. Using our calibrated data, we applied both our own fitting software and 
standard fitting software to measure — , but discovered that we could obtain results ranging 
from significant detection of either sign, to strong null limits, depending upon which sets of 
lines and which fitting method was used. We thus speculate that the discrepant results on 
— reported in the literature may be due to random fluctuations coming from under-estimated 
systematic errors in wavelength calibration and fitting procedure. 
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1. Introduction: 
Constant 



Varying Fine Structure 



The fine structure constant today, ao = 
1/137.03599911, is usually thought of as a fun- 
damental, unchanging, constant of nature, but 
recently both experimental and theoretical pa- 
pers have challenged this assumption, (e.g., see 
the review by Garcia-Berro, Isern, and Kubyshin 
(2007)). Motivated especially by the possible ex- 
perimental detection (see below) of a change in 
a, we applied for and took four nights of Keck 
data, half of it through the Keck iodine cell, on 
QSO PHL957. Our goal was to get an extremely 
well-calibrated, high signal/noise absorption spec- 
trum on a distant damped Lya (DLA) system 
(z = 2.309) in order to measure the value of the 



1 



fine structure constant more than 10 billion years 
ago and compare it to the value today. 

The basic method to determine — = (a z — 
q.q)/q.q, is to measure differences in redshifts be- 
tween different atomic transitions of elements in 
the same physical system, and use the fact that 
for small changes in a the energy level of a given 
atomic transition can be approximated as 

Aa , . 

uj a =uj + 2q , (1) 

a 

where luq = 1/Ao is the frequency of the transition 
on Earth, Lu a is the frequency in the high redshift 
cloud, and the q- values measure the dependence of 
u> a on a and have been calculated for many com- 
mon transitions, (see for example, Murphy et al. 
2001a, Dzuba, et al. 2002, and Porsev, et al. 2007 
for more detailed discussion and Table [2] for the 
values of q for various transitions.) The values of q 
depend upon the electron orbital configurations of 
the initial and final quantum states, and therefore 
different transitions have different values of q. If 
all transitions had the same q, then all wavelength 
shifts would be the same and one could absorb any 
change in a into the determination of the redshift 
of the physical system. However, since different 
transitions have different values of q, the relative 
transition wavelengths will differ from what they 
are in the lab if ^ ^ 0. 

For example, Murphy, et al. (2001a; 2001b; 
2003; 2004), used a many-multiplet method on 
Keck HIRES data of 143 absorption systems in the 
redshift range 0.2 < z < 3.7 to find a significant 
reduction of a in the past, — = (—5.43 ± 1.16) x 
10~ 6 , while Chand, et al. "(2004) and Srianand 
et al (2004), used the same method for a subset 
of transitions on VLT/UVES data on 23 absorbers 
to find ^ = (-0.6±0.6) x 10~ 6 . This latter anal- 
ysis was criticized by Murphy, et al. (2008) who 
reanalyzed the Chand et al. data to get — = 
(—4.4 ±3.6) x 10 -6 , consistent with their previous 
result. In the meantime other groups returned re- 
sults, for example, Levshakov, et al. (2006) used 
the VLT/UVES spectrograph to study Fe II lines 
in one system at z — 1.15 and found a null result, 
= (-.07±0.84) x 10~ 6 , and in another system 

at z = 1.8 to find = ( 5 - 4 ± 2 - 5 ) x 10 ~ 6 ( Lev " 
shakov, et al. 2007). Murphy, et al. (2006) have 
also criticized these results, claiming that the data 
do not allow limits as strong as those reported. 



Subsequently, the Levshakov, et al (2006) results 
were weakened to = (-0.12 ± 1.8) x 10~ 6 for 
the z = 1.15 system and = (5.7 ± 2.7) x 10~ 6 
for the z = 1.8 system (Molaro, et al. 2008), still 
a null result inconsistent with the detections. 

Given the above inconsistent results, we were 
particularly interested in the Fe II A1608 and Fe II 
A1611 transitions since these have q values that are 
both large and more importantly of opposite sign. 
Thus, if a was different in the past, the relative 
positions of these two lines should be significantly 
shifted from their laboratory values. For our DLA 
at z = 2.309, a relative shift between Fe II A 1608 
and Fe II A1611 of about — 136±21 m/s is expected 
if the Murphy et al. value = (™5.43±0.116) x 
10~ 6 is correct. (Fe II A1608 shifts by -54 ± 12 
m/s, while Fe II A1611 shifts by 82±18 m/s in the 
rest frame.) Thus our goal was to centroid these 
lines to better than 50 m/s, so as to determine — 
in a single ion in a single absorption system. 

Our method, which is close to that used by Lev- 
shakov, et al 2006, contrasts with that of Murphy 
et al. (2001a; 2001b; 2003; 2004) and Chand et 
al. (2004) where the signal/noise was not high 
enough to detect the — signal in any single pair 
of lines; they did a statistical averaging over many 
transitions in many absorption systems, and thus 
might be subject to systematic errors in selection, 
calibration, or averaging procedures. Since we ex- 
pected to have superbly well-calibrated spectra, 
and these two Fe II lines appear in the same echelle 
order we hoped we could give convincing evidence 
for or against a change in the fine structure con- 
stant. 

For our work, in addition to the Fe II lines, 
there are also several Ni II, Si II, Al II, and Al 
III lines that fall in the wavelength range cov- 
ered by the iodine cell and that we can use. 
In what follows, besides the Fe II A1608/A1611 
pair, various other sets of lines are used. Poten- 
tially we could fit all 16 lines that have calcu- 
lated q values and that appear in our spectra: Fe 
II A1608/A1611/A2344, Ni II A1709/A1741/A1751, 
Si II A1526/A1808, Al III A1854/A1862, Al II 
A1670, Cr II A 2062/A2056/A2066, and Zn II 
A2026/A2062. If we restrict ourselves to lines 
that occur at wavelengths for which we have io- 
dine spectra we would use only the 9 lines: Fe 
II A1608/A 1611, Nill A1709/A1741/A1751, Si II 
A1808, Al III A1854/A1862, and Al II A1670. If we 
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worry that saturated lines (those with minimum 
flux less than 10%) may not be accurately fit we 
can restrict ourselves to the 7 lines that meet the 
above criteria and are unsaturated: Fell A1611, 
Nill A1709/A1741/A1751, Sill A1808, and Al III 
A1854/A1862. Finally, we note that Al III has a 
systematically higher ionization potential than the 
other ions and is a sub-dominant ionization state 
of Al. It thus could exist in a physically differ- 
ent location. Thus, we most reliably consider the 
five calibratable, unsaturated, singly ionized tran- 
sitions: Fell A1611, Nill A1709/A1741/A1751, and 
Sill A1808. 

2. Data and Extraction 

PHL957 is a bright (B=16.6) QSO at z = 2.7, 
with a damped Ly alpha system at z = 2.309 
(Beaver, et al 1972). We obtained 5.5 hours of 
data on November 1, 2002, 4 hours on Oct 3, 2004, 
4 hours on Oct 4, 2004, and 5 hours on Oct 5, 2004. 
Five of the 11 exposures taken in 2002 had the io- 
dine cell in place, while 6 of the 13 exposures taken 
in 2004 were taken through the iodine cell. Ta- 
ble [1] shows how the iodine cell exposures were in- 
terspersed with the non-iodine cell exposures. Ta- 
ble [1] also shows the times that the relevant Th/Ar 
calibration arcs were taken, as well the temper- 
atures inside the HIRES enclosure, and whether 
or not the cchcllc gratings were moved between 
the the QSO exposures and the relevant Th/Ar 
exposures. The data from 2002 were acquired 
through the CI decker (FWHM w Gkms" 1 ), with 
the kv380 blocking filter in place. The data from 
2004 were also observed through the CI decker but 
with the kv418 blocking filter, and the CCD mo- 
saic was binned by two in the spatial dimension. 

The 2002 spectra were extracted and combined 
using MAKEE (Barlow 2002). The 2004 spec- 
tra were extracted and combined using the XIDL__ 
package HIRedusH (Bernstein, et al. in prepara- 
tion). The 2002 and 2004 data cannot be easily 
combined since a new mosaic of CCD's was in- 
stalled in the spectrograph in between these runs. 
We note that the 2002 data have a substantially 
worse signal/noise per pixel (42 for the non- iodine 
cell exposures) compared to the combined 2004 



1 http : / / www. ucolick . org/ ~xavier /IDL 

2 http: / /www. ucolick.org/~xavier/HIRedux/HIRES_doc. html 



non-iodine cell data which have a S /N of about 70 
per pixel. 

2.1. Iodine Cell 

The Keck iodine cell has been used exten- 
sively in searches for extra solar planets using the 
Doppler technique (Butler, et al. 1996). The cell 
is placed in the beam and superposes several nar- 
row absorption lines per Angstrom on the QSO 
spectrum between 4950A and 5900A. As pointed 
out by Murphy, et al. (2001b), the different opti- 
cal paths of the Th-Ar lamp and the QSO spec- 
trum are a possible source of systematic error in 
the wavelength calibration. Using the iodine cell, 
wavelength calibration errors can be dramatically 
reduced since issues such as atmospheric disper- 
sion, guiding errors at the slit, and all changes to 
the optics of the spectrograph are shared by both 
the iodine lines as well as the QSO spectra. In 
addition, the Th-Ar lamp spectra are done at dif- 
ferent times than the QSO spectra. 

Figure [1] shows a sample of the iodine cell spec- 
tra taken with the Fourier Transform Spectrome- 
ter (FTS) at KPNO with a resolution of around 
170,000 and a signal/noise of 700 per pixel (But- 
ler, et al., 1996; Marcy, 2008). This high reso- 
lution and S/N means that we do not expect the 
iodine spectrum to be a significant source of wave- 
length calibration error. Figure [5] shows a portion 
of echelle order 67 taken both with and without 
the iodine cell. In the bottom panel of Figure [2] 
one can see the iodine lines as well as the Fe II 
A1608 and Fe II A1611 lines of the DLA towards 
PHL957. The top panel of Figure does not con- 
tain the iodine lines. 

2.2. Wavelength Calibration 

In order to do the wavelength calibration we 
started from the XIDL Th-Ar calibration. E XIDL 
takes Th-Ar wavelengths that have been trans- 
formed from vacuum values to air values so that 
lines can be identified. Using these identified lines 
a polynomial is fit over the full 2D spectrum. The 
wavelength scale found is then transformed back 
to vacuum values using the inverse Edlen formula. 



3 As a check we also extracted spectra using MAKEE, both 
with and without the sky line wavelength calibration op- 
tion. In both cases iodine re-calibration of about the same 
magnitude as with XIDL was needed. 
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Fig. 1— Three Angstroms of the KPNO FTS 
spectrometer iodine cell spectrum. 
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Fig. 2. — Combined 2004 absorption spectra 
showing a portion of order 67 containing the Fe 
II A 1608 and Fell A 16 11 lines. The top panel 
shows the spectrum taken without the iodine cell 
in place, and the bottom panel shows the spectrum 
taken through the iodine cell. 



After finding the Th-Ar wavelength scale we 
use our iodine measurements with two similar 
but independent methods to recalibrate. In both 
methods we convolved the high signal/noise iodine 
spectrum measured at KNPO (Butler, et al. 1996) 
with a Gaussian and then minimized the \ 2 of the 
difference between this convolved spectrum and 
the PHL957 spectrum shifted by an amount AA ca i. 

In the first method a 3-parameter fit was per- 
formed in each wavelength bin of the PHL957 
spectra, with bin sizes from 1 to 10 Angstroms 
being used. The fit returned the wavelength shift, 
AA C ai, the sigma of the convolution Gaussian, 
as well as a multiplicative continuum offset and 
the formal errors of these quantities. Using this 
method, regions of the PHL957 spectra that had 
strong lines were not included in the fit since these 
could distort the results; a linear interpolation of 
bins on cither side of the line region was used to 
find a correction at the line center. 

Using the second method, strong lines in the 
PHL957 spectra were fit and subtracted from the 
data before differencing with the convolved iodine 
spectra. This allowed direct determination of a 
wavelength correction even at line centers (except 
for saturated regions which are removed as above) . 
Also, in the second method rather than fitting 
for both AA ca i and the convolution sigma in each 
wavelength bin, a rolling 5 to 10 Angstrom bin 
iterative method was used that found the wave- 
length correction for each data point and also the 
one best convolution sigma for the entire echcllc 
order. We also tried setting the sigma of our 
Gaussian convolution kernal to the resolution ex- 
pected from the physical setup of the telescope, 
R = 50, 000. This sigma was consistent with but 
slightly smaller than the sigma we found by fit- 
ting, but the wavelength offsets found either way 
agreed to within errors. In this method the errors 
in the wavelength calibration were found from the 
values of AA ca i that caused the x 2 to increase by 
1. 

The methods of finding the continuum of the 
FTS iodine spectra were also somewhat different 
for the two methods. In the first method, we used 
the highest flux value in a variable size wavelength 
bin as the continuum value. In the second method, 
we averaged the three largest flux values (and their 
corresponding wavelengths) in each one Angstrom 
bin, and then set the continuum by fitting a spline 
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through these averaged points. 

Determining the continua of the PHL957 spec- 
tra taken with the iodine cell is difficult because of 
the large number of unresolved overlapping iodine 
lines. The first method used a 4th order polyno- 
mial fit to the set flux values within 2-sigma of the 
highest flux found over a certain wavelength range. 
The second method found the PHL957 continuum 
using a standard continuum fitting program, thus 
probably underestimating it. Because of this un- 
derestimate, we repeated our analysis using sev- 
eral different possible continua and discovered that 
our final wavelength calibration results were ro- 
bust for all plausible PHL957 continua. 

A comparison between the results of the two 
methods showed agreement in AA ca i within errors, 
and also good agreement in the reported errors on 
AAcai- We also tried a method that used a cross- 
correlation between the FTS and PHL957 spectra, 
which also worked, but did not seem as accurate 
as either of the other two methods. 

We will use the results of our second method 
throughout the rest of the paper. Our final recali- 
bration shift data is smoothed with a 5- Angstrom 
box filter to reduce fitting noise. 

Figure [3] compares a small portion of echcllc 
order 67 of PHL957 with the convolved and shifted 
FTS iodine cell spectrum. 

Note that the Th-Ar calibrated spectra used as 
input to our recalibration pipeline should not in- 
clude the standard Keck HIRES heliocentric cor- 
rection that accounts for the Doppler shift from 
the changing motion of the Earth around the Sun. 
We recalibrate in the lab frame, and then make our 
own, more accurate, solar system barycentric mo- 
tion correction using the code developed by Marcy 
and Butler (2008). This barycentric correction 
should be good to better than 1 m/s. 

We found that the accuracy of the wavelength 
recalibration as measured by our calculated error 
depended strongly on the size of the bin used for 
the convolution. Thus we report values from a 
10A bin analysis as a compromise between small 
uncertainty and constancy of the wavelength cali- 
bration over the width of a single bin. 

Figure [4] shows an example (exposure 3-1) of 
the resultant wavelength recalibation shift over the 
entire wavelength range for which we have both 
QSO and FTS iodine spectra. This figure is a 



major result of this work. 

We see several interesting effects. First the size 
of the shift is significant: typically from 500 m/s 
to 1000 m/s corresponding to a substantial frac- 
tion of a CCD pixel (roughly 1300 m/s/pixel or 
0.023A/pixel at A w 5300A.) Contrast these sys- 
tematic velocity errors with an expected shift of 
136 m/s between Fe II A1608 and Fe II A1611 
due to the — claim described in the introduction. 

a 

We note previous workers have found similar Keck 
HIRES shifts of lOOOm/s or larger (e.g. page 734 
of the night airglow line paper by Osterbrock et 
al. 2000), and Figure 4 of Suzuki et al. 2003) but 
this does not seem to be widely appreciated. 

Second, there is a clear pattern seen in each 
echelle order, with the shift increasing from the 
edge of each order and reaching a maximum near 
the middle. For the purpose of measuring As it is 
not the overall shift discussed above that is impor- 
tant, but the relative shift between the transitions 
being compared. Depending on the echelle order 
Figure |4] shows relative shifts of 300 m/s to 800 
m/s within the same order. These shifts could be 
dangerous since, depending upon the lines being 
being compared, they could result in a systematic 
relative velocity shift between lines, thereby mock- 
ing a changing a. 

Next, we are interested in how this wavelength 
recalibration shift varies with time. Thus, Fig- 
ure [5] shows a more detailed look at echelle or- 
der 67 which contains the Fe II A1608/A1611 lines. 
In the figure, different lines are the recalibration 
shifts for each of the five exposures that used the 
iodine cell and are labeled by their ID's, e.g. 3-0 is 
the first exposure of PHL957 taken on Oct-3-2004. 
See Tabled] for more details. 

We note that each exposure has a similar, but 
not identical shape as a function of wavelength, 
and that there is a different wavelength offset for 
each exposure. While the variation over this order 
for each exposure is typically 500 m/s or less, the 
shift between different exposures can be as much 
as 2000 m/s. This means that the Th-Ar wave- 
length calibration that gives us the wavelength 
solution as a function of pixel, is not stable and 
drifts with time. 

It is of interest to note that recalibration offsets 
between nights are in general larger than the drifts 
during each night. One sees that on Oct 3 and 
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Fig. 3. — A small portion of echcllc order 67 of 
PHL957 (thick line) compared with the convolved 
and shifted FTS iodine cell spectrum (thin dashed 
line) . 
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Fig. 5. — Time evolution of iodine cell wavelength 
recalibration shift for echelle order 67. Each line is 
labeled by the day-observation number, with the 
solid line being the exposure taken earlier in the 
night. 



Fig. 4. — Wavelength correction over entire wave- 
length range for exposure 3-1 of PHL957. The 
shift is between the standard Th-Ar wavelength 
calibration and the wavelength scale found by fit- 
ting to the FTS iodine spectrum. The echelle or- 
der is marked. 
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4, when the iodine cell exposures where taken one 
after the other, the shift during the night was sub- 
stantially smaller than the inter-day shifts, while 
on Oct 5, when there was three hours between io- 
dine cell exposures, there was a larger shift. 

Figure [5] is probably more important than the 
previous figure, since it shows that the Th-Ar cal- 
ibration is not stable over time. If the wavelength 
shifts shown in Figure 3] were stable in time, they 
could be removed and would have little effect on 
the measurement of — . But large systematic 
wavelength shifts during the night mean that mea- 
surements at the desired level of precision may be 
difficult with Th-Ar calibration alone. 

It is important to note that even though the 
calibration errors reported here are much larger 
than the final velocity precision needed to deter- 
mine ^ , it is possible that these calibration errors 
average out and do not ruin the final — deter- 
mination. In the many multiplet method, many 
different lines are compared across many different 
absorbers at many different rcdshifts. If the signs 
of the calibration errors are random, the calibra- 
tion errors may average away. A complete discus- 
sion of this possibility is beyond the scope of this 
work, but will be presented elsewhere. 

2.3. Understanding the Calibration Shifts 

We made some preliminary attempts at un- 
derstanding the causes of these unexpected wave- 
length calibration shifts. While complete under- 
standing of the causes and exploration of methods 
to correct for the shifts are beyond the scope of 
this paper, we report some ideas and preliminary 
results here. We hope to use additional data and 
analysis to find a more complete understanding, 
which will then be published elsewhere. 

First, it is interesting that during all three 
nights the shifts became more negative at about 
500 m/s per hour. Thus we plotted the shifts vs 
time, and also vs. the various temperatures, etc. 
that HIRES reports. With only six iodine expo- 
sures it is difficult to discern a pattern and impos- 
sible to that prove a pattern exists, and in fact we 
did not see any fully convincing trends. 

The best partial trend is a decrease in cali- 
bration shift with ATempin, where ATempin is 
the difference in HIRES enclosure temperature be- 
tween when the Th/Ar calibration exposure was 



taken and when the QSO exposure was taken. We 
plot this trend in Figure where the 6 QSO ex- 
posures are shown as asterisks and individually 
labeled. The data is given in Table Q] We see that 
the trend is badly broken by the 4th exposure on 
Oct 5 (5-3 or ID 2097), which originally led us to 
discount temperature as the main culprit. How- 
ever, we have some exposures of a star (HD209833) 
taken the same nights through the iodine cell, and 
we plot the calibration shifts vs ATempin for them 
as small boxes. The complete analysis of these star 
exposures is beyond the scope of this paper, but 
we see that they fall on the same trend line, show- 
ing that exposure 5-3 seems to be an exception. 

Next, we check whether or not the echelle grat- 
ings were moved between the time of the Th/Ar 
arc exposure and the QSO exposure. This is 
shown in TableQ] We see that only on Oct 3 (expo- 
sures 3-1 and 3-2) were the gratings not moved be- 
tween Th/Ar calibration and the QSO exposures. 
In Figure [6] we see that the dispersion of exposures 
3-1 and 3-2 from the trend line is smaller than for 
exposures 4-1 and 4-2. Exposure 5-0 is near the 
trend line, but of course 5-3 is way off, probably 
for some other reason. Thus we see a weak hint 
that moving the gratings between Th/Ar calibra- 
tion and science exposure can cause a calibration 
error. 

To understand the problem with exposure 5-3 
we note that a different Th/Ar arc was used for 5-3 
than for the other iodine exposure (5-0) taken that 
night. To check whether something went wrong 
with the arc exposure (ID=2107) used to calibrate 
5-3, we redid the iodine recalibration using arc ex- 
posure 2026, taken much earlier that same night. 
This point is also plotted in Figure [5] as a labeled 
triangle. The point shifts substantially, but keeps 
the same (large) distance from the trend line. 

Thus we see that while temperature surely plays 
an important role and a correction perhaps can 
made for this, there are other parameters that 
seem to be important, not all of which are under- 
stood at this point. Moving the gratings between 
Th/Ar and science exposures may increase the cal- 
ibration error. In what follows we will not attempt 
to use the temperature trend to make any correc- 
tion, though in future work this might be possible. 

We note that other contributions to the cal- 
ibration shifts are possible. N. Suzuki (private 
communication) suggested that the cause could 
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Fig. 6. — Iodine recalibration shift w s hift vs 
ATempin for the PHL957 iodine exposures (la- 
beled asterisks), and a star HD209837 (small 
squares). The shift is the single best wavelength 
recalibration shift for echelle order 66 found by 
our iodine line fitting program. The quantity 
ATempin is the temperature inside the HIRES 
enclosure measured during the Th/Ar arc calibra- 
tion exposure minus the same temperature mea- 
sured during the data exposure. The solid trend 
line, Wshift = —2459 ATempin+853. is a fit to the 
points excluding QSO exposure 5-3. Exposure 5-3 
appears twice labeled by which Th/Ar calibration 
arc was used. 



be that the HIRES spectrograph is mounted at 
a small angle with respect to the optical axis, so 
that the light path rotates as the telescope moves, 
resulting in variable vignetting in comparison to 
light from the Th-Ar lamp which is fixed to the 
spectrograph. More generally, the optical path of 
the Th/Ar calibration light differs from the op- 
tical path of the QSO light resulting in different 
wavelength to pixel mapping. P. Molaro (private 
communication) suggested that the cause could be 
the changing position of the QSO image centroid 
in the slit. We hope to check this with additional 
exposures at a future date. M. Murphy (private 
communication) suggested that the cause could be 
temperature and pressure differences between the 
Th/Ar calibration exposures and the data expo- 
sures, as well as the resetting of the echelle grat- 
ings. The analysis presented above was, in large 
part, motivated by the comments of M. Murphy. 

It would be useful to study and understand 
these wavelength calibration shifts, perhaps by an- 
alyzing other data, since it might then be possible 
to model and remove them. This will be pursued 
in more detail elsewhere. 

Overall, the wavelength calibration errors re- 
ported here may seem to be a remarkable result, 
but we note that Osterbrock et al, (2000) recali- 
brated the Keck HIRES Th-Ar tube wavelengths 
using several night sky airglow lines and found 
similar (~ 0.05A) calibration errors. In addition, 
both the magnitude and time variation of these 
shifts were detected using Ly alpha forest lines by 
Suzuki, et al. (2003). 

It is interesting to ask whether sky lines them- 
selves could be used to calibrate the QSO spectra 
without use of the iodine cell. In fact, the cur- 
rent version of MAKEE data reduction pipeline 
includes an option to calibrate using sky lines. 
We used our iodine method to check on the wave- 
length accuracy of the MAKEE sky line calibra- 
tion, but still found substantial wavelength shifts 
both within and across Echelle orders, and as a 
function of time. 

2.4. Use of the Calibration 



Our original hope was that if there would be a 
time-stable wavelength calibration found from the 
iodine cell exposures then that then could be ap- 
plied to all the exposures. When the iodine cell 
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is in place we get well-calibrated wavelengths, but 
substantially less S/N. This is because iodine lines 
cover basically the entire spectrum, decreasing the 
number of photons, adding spurious components 
to line fitting programs, and also making the con- 
tinuum difficult to determine. So, it is the expo- 
sures without the iodine cell that carry the highest 
signal/noise and that we wish to use to test for a 
changing alpha. Since again, as Figure [5] shows, 
the wavelength calibrations vary substantially over 
the course of a night, it is not clear how to use the 
calibration in our fits. We tried several methods 
before settling on the following: 1) for exposures 
taken with the iodine cell we add the wavelength 
recalibration correction to each wavelength output 
by the standard HIRES pipeline. 2) For exposures 
taken without the iodine cell, we interpolate/ ex- 
trapolate the wavelength correction using the two 
iodine exposures nearest in time, and assuming 
that the correction changes linearly with time. 3) 
We then add the barycentric correction to every 
data point. 

At this point we have wavelength corrected 
spectra that can be fit. We can either add the 
spectra for each order together (rebinning since 
the spectra are no longer on a common wave- 
length scale) or just combine and sort the data 
files for each order together giving more flux mea- 
surements to be fit. 

We can check the effect of our recalibration 
by seeing how well sharp features in the spectra 
line up. For example, Figure [7] shows a close- 
up of the left edge of the saturated Fe II A 1608 
line before and after iodine recalibration. The 
panel labeled 'Th/Ar' (a) shows an overlay of the 
six iodine exposures calibrated with the standard 
HIRES pipeline, while the panel labeled 'Iodine' 
(a) show the same after our recalibration. The 
features line up significantly better after recali- 
bration; spreads of more than 2000 m/s become 
significantly less than 1000 m/s. 

The other two panels show overlays of the seven 
exposures taken without the iodine cell. The panel 
labeled 'Th/Ar' (b) is again the result from the 
standard HIRES calibration, while the panel la- 
beled 'Iodine interp' (b) shows the same seven ex- 
posures, but recalibrated using the interpolation 
scheme described above. For the non-iodine expo- 
sures, there is some improvement in alignment of 
the line edge, but not nearly as much as for the io- 




Av shift (km/s) 



Fig. 7. — Close-ups of the left edge of the Fe 
II A1608 line for observations taken on October 
3 (black), October 4 (blue), and October 5 (red). 
The panels labeled 'Th/Ar' show results from the 
standard Th/Ar wavelength calibration, while the 
panels labeled 'Iodine', or 'Iodine interp' show re- 
sults after recalibration with the iodine cell. The 
two panels labeled (a) are for the 6 exposures with 
the iodine cell in place, while the two panels la- 
beled (b) are for the 7 exposures taken without 
the iodine cell, but recalibrated either using the 
iodine lines (upper panel) or using interpolation 
as described in the text (lower panel). The ex- 
tra bumps in the (a) panels are iodine lines, and 
the first part of the spectrum in exposure 5-0 is 
missing due to a cosmic ray. The iodine lines do 
not line up because of the barycentric correction 
needed for PHL957. 



9 



dine exposures. Here spreads of 1000 m/s or more 
seem to remain even after recalibration. 

Since for signal/noise and fitting reasons, we 
must use mainly the non-iodine exposures, it is 
disappointing that our efforts to recalibrate may 
not pay off. Our inability to model the calibration 
shifts between iodine exposures seems to be the 
main culprit. 

3. Fitting the Data 

In what follows, we mostly just combine the re- 
calibrated, continuum subtracted spectra for all 
non-iodine exposures into one large file, and then 
sort by wavelength and fit. In some cases we re- 
bin the data combining several data points and 
adding the errors in quadrature, and in some cases 
we just use the co-added spectra from the standard 
HIRES pipeline. We treat the data from 2002 sep- 
arately since it was taken on a different CCD chip. 
We also treat the data with and without the iodine 
cell separately since the iodine lines add significant 
effective noise to the PHL957 spectra. 

3.1. Voigt profile fitting 

We fit the spectra using a code we developed 
based upon the CERN library MINUIT minimiza- 
tion program and the humdev Voigt profile calcu- 
lation routine (Wells, 1999). We compared our re- 
sults with those of VPFIT (Carswell et al. 2008) 
and DUDE (Kirkman et al. 2003) and found 
agreement for individual lines. Using our own code 
allows us easily to do joint fits with the redshifts 
of transitions varying independently and also al- 
lows us to add additional parameters as needed. 
Figures [8] and [9] show fit results for several transi- 
tions for the combined 2004 data. The fit parame- 
ters are the redshift, the line width (b- value), and 
the column density. We note that several of the 
lines are saturated, making wavelength centroid- 
ing more difficult and less accurate. 

3.2. Limit on precision of ^ 

The S/N of each 2004 non- iodine spectra is ap- 
proximately 25 per pixel, giving a total S/N for the 
7 non-iodine exposures of about 70 per pixel. The 
individual iodine exposures have a S /N of around 
19 per pixel, for a co-added total of about 47 per 
pixel, though the iodine lines cause the effective 
S/N to be lower than this. The co-added 2002 data 
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Fig. 8. — Voigt profile fit results for several 
PHL957 lines using our fitting code. The velocity 
components are marked. These are unconstrained 
fits for each line so the velocity components of var- 
ious transitions are not forced to agree with each 
other in these plots. 
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Fig. 9. — Voigt profile fit results for several 
PHL957 lines using our fitting code. The veloc- 
ity components are marked. 



have S/N approximately 42 per pixel. Our total 
signal/noise is thus quite high for a high redshift 
object, and we want to first estimate the minimum 
possible error on — that could be obtained with 
these spectra. 

To do this, we use a Fisher matrix type method 
suggested by Murphy, Webb, and Flambaum 
(2006), and Bouchy, Pepe, and Queloz (2001). For 
continuum normalized flux spectrum F(Xi) with 
1-sigma error array <jf (Xi), the minimum possible 
uncertainty in velocity contributed by pixel i is: 



&v(Xi) 



(Tf(Xi) 



Xi(dF(\i)/d\i) 



(2) 



Thus more precise velocity measurements come 
from pixels with large flux gradients and small 
errors. The minimal possible uncertainty in the 
velocity of a portion of spectrum is thus: 



a v = [Sjl/a^A,)] 2 ] 



21-V2 



(3) 



where the sum is over pixels. Finally, the mini- 
mum uncertainty in ^ can be found by perform- 
ing a least-squares fit to a version of Equation [T] 



v + 



Aa 



-2cqjX, 



3 A 0j, 



(4) 

where j numbers the lines that are being com- 
pared, «o is a constant offset (degenerate with the 
system redshift), and the minimum error in — is 
just the fit uncertainty in the slope of this linear 
equation. 

Murphy, Webb, and Flambaum (2006) per- 
formed this analysis for their data and for the 
data of Chand, et al. (2004), and Levshakov, 
et al. (2006), finding that while their own errors 
were (barely) within the allowable minimum, the 
reported uncertainties of the others were smaller 
than the minimum possible. The corrected version 
of the Levshakov results (Molaro, et al. 2008) does 
seem to be in agreement with the minimum error 
limit. 

We would like to perform such an estimate, but 
first note that previous estimates of minimum er- 
rors did not include uncertainties in the q values. 
Table [2] shows that these uncertainties can be sig- 
nificant, and inclusion of these uncertainties will 
increase both the Fisher matrix minimum errors 
and also the error on — . One can include these 
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uncertainties, <r q (j), in the fit and obtain a more 
realistic minimum error estimate. In this case, 
rather than a simple linear least-squares fit one 
must use a method that allows errors in both the 
ordinate and the abscissa. Since the uncertainties 
in q are theoretical estimates and not statistical 
errors, this method is not technically completely 
consistent, but it should give a reasonable idea of 
the size of the effect. As we find below, the small- 
ness of the claimed values for ^ imply that the 
uncertainties in q- values are not very important. 

Tabic [2] shows the minimum values of a v for 
the portion of the combined 2004 spectra contain- 
ing each of the transitions that have a calculated 
q- value. These results were calculated using co- 
added spectra from the standard XIDL HIRES 
pipeline that did not include the iodine cell wave- 
length recalibration; this simplifies the calculation 
but has little effect on these minimum error re- 
sults. 

The values range from 25 m/s for the strong 
Fe II A1608 line to 153 m/s for the weak lines 
such as Fe II A1611. It is interesting to note 
that the maximum velocity precisions obtainable 
with our data seem to be sufficient to measure the 
shifts predicted by a changing value of — at the 
level claimed by Murphy et al. (2003). However, 
we also see that the weakness of the Fe II A1611 
line means that the precision obtained from using 
only the Fe II A1608/A1611 pair may not be good 
enough for this purpose. 

One can combine all these minimum a v values 
using Equation[3]to get an overall minimum veloc- 
ity error, but since — is determined by differences 
in redshifts this is not appropriate. If one just 
wanted to determine how accurately the redshift 
of the entire system could be determined, then one 
could combine the minimum errors in Table [2] to 
find a v (min) (all) = 13 m/s. If we included only 
the lines for which we have iodine cell calibration 
the result is cr v (min) (calib) = 15 m/s. The 2002 
data alone would give 38 m/s precision, which if 
combined with the 2004 data would give an ulti- 
mate velocity precision of 12 m/s. Of course, as 
discussed below there are several important sys- 
tematic errors that greatly increase these uncer- 
tainties. 

Next we perform the least squares fit to find 
minimum possible errors on — from this data and 
display the results in Table [3l These uncertainties, 



e.g. <r(— ) > 1.2 x 10 6 , are reasonably competi- 
tive with errors quoted by Murphy, et al. (2001a; 
2003; 2004), with Chand, et al. (2004), and with 
Levshakov, et al (2006). We again note that the 
latter two groups seem to have produced measure- 
ments of — with errors smaller than their min- 

a 

imum possible errors, something Murphy, Webb, 
and Flambaum (2006) have criticized, and which 
Levshakov et al. subsequently corrected (Molaro, 
et al. 2008). 

We note that minimum possible error coming 
from analysis of the Fe II A1808/A1611 pair is 
c(— ) > 6.2 x 10~ 6 , substantially less precise than 
could be found using several lines. As mentioned 
this is mostly because the Fe II A1611 line is so 
weak (due to its rather low oscillator strength) 
and thus its redshift cannot be measured very pre- 
cisely. Therefore, our original idea of using just 
these two lines is probably not that useful. In ad- 
dition, examination of the lines shows that it is 
only the saturated regions of the Fe II A1608 line 
profile that are detectable in the Fe II A1611 line 
profile. Thus a joint \ 2 fit cannot accurately re- 
cover their relative velocity offset. It seems it is 
better to compare strong lines with other strong 
lines, and weak lines with other weak lines. We do 
this in the following section. 

We can also test the importance of the uncer- 
tainties in q listed in Table [2j by doing a linear 
fit that allows errors in both the abscissa and or- 
dinate. In this case, the resulting minimum error 
depends upon the fit value of — , that is, the slope 
of the line given in Equation [4] This is to be ex- 
pected since if — = 0, it does not matter what the 
values of x are, while if there is a large slope then 
uncertainty in x will propagate to uncertainty in 
v, and therefore uncertainty in — . A simple way 
to estimate the increase in — uncertainty is to 
just do this propagation of errors, that is, change: 

^ * (5) 

where a x = —2qc\ocr q . The results of this error 
propagation are also shown in Table[3]for the Mur- 
phy, et al. (2003) value = -5.4 x 10~ 6 . We 
note that because the slope — is small, the in- 
crease in uncertainty is also quite small, no larger 
than around 14% depending on the set of lines 
used. Thus previous workers who ignored this 
effect arc not making a large underestimation of 
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their — uncertainties and limits. Because of the 
smallness of this effect, we will not consider the 
uncertainty in q in what follows. 

It is important to realize that the minimum pos- 
sible errors discussed above are never reached in a 
real observation for several reasons as follows: 

1. As extensively discussed in Murphy, et al. 
(2008, section 2.2.2), comparison between transi- 
tions with different degrees of saturation will give 
velocity uncertainties derived from Voigt profile 
fitting substantially greater than the Fisher ma- 
trix minimum. This is because velocity precision 
coming from the sharp edges of the saturated pro- 
files cannot be directly compared with the central 
regions of unsaturated profiles. Especially in mul- 
ticomponent systems, this will result in degener- 
acy between the component fit parameters, and 
make the redshift determination of any one com- 
ponent less accurate. This is evident when the co- 
variance matrix for one of our 5 component fits is 
examined. We see strong correlation between the 
redshifts of the various components and between 
these redshifts and various other parameters such 
as the b- values. 

We can investigate this effect directly in our 
data by comparing the formal parameter fit errors 
with the minimum errors calculated above. For 
example, a 5 component fit of the saturated Fe II 
A 1608 line using the co-added data gives a mini- 
mum possible redshift error of 33.4 m/s, while the 
fitting program returns an error of 184 m/s on the 
strongest component. 

For Si II A1808, an unsaturated line, the effect 
is smaller, but still pronounced: 32.6 m/s mini- 
mum error vs. 55 m/s fit result. We note that the 
minimum errors found from using formulas [2] and 
[3] on the individual exposures are quite consistent 
with the minimum errors found from the co-added 
data: For 3-componcnt fit to Fe II 1A1608 we find 
32.6 m/s for co-added data, vs. 27.4 m/s from 
combining all 7 non-iodine cell exposures. For Si 
II A1808 the comparison is 50.4 m/s from the co- 
added compared to 49.9 m/s for individual expo- 
sures. Note that errors returned from the fit soft- 
ware are actually also overly optimistic because: 

2. as discussed above there are other sources of 
error such as wavelength calibration, 

3. the errors on the flux are not really Gaussian 
which means that larger than Gaussian deviations 



occur, 

4. it is very difficult to determine the number 
of components needed to fit each transition, espe- 
cially for saturated, or nearly saturated lines. The 
formal fit error returned on a component redshift 
does not include the possibility of another compo- 
nent, which may be apparent in some transitions 
and not in others. The \ 2 surface can have sev- 
eral local minima, and small measurement errors 
can switch one between needing another compo- 
nent and not needing another component. Since 
it is the difference between component redshifts 
that is used to determine — , this can be a source 
of systematic error. For example, we found ex- 
amples where a region of a spectrum was formally 
slighter better fit by two nearby components than 
by a single component, but the redshift of the com- 
ponents in the two cases shifted by around 1500 
m/s, a value large compared to what is expected 
from ^s. Murphy et al. (2008) investigated this 
effect in detail (see their Figure 7) and found that 
while this effect does exist if the the number of fit 
components is less than the true number of com- 
ponents, this systematic error is greatly reduced if 
the number of fit components is equal to or greater 
than the true number of components. 

3.3. Fitting for — : simple method 

We tried several methods of deriving a value 
of — from our Voigt profile fits. The simplest 
method is to fit each of our N lines independently, 
and then do a least squares linear fit of Equation[U 
The value of Vj can be found from 

Vj/C = (z avo - Zj)/(1 + Zavc), (6) 

where z avc is the redshift obtained by averaging 
over all the transitions detected in a given veloc- 
ity component. The result of this simple direct 
method for component 1 (the component with the 
largest column density) of the 9 lines for which 
we have both q-values and an iodine spectrum is 
shown as the solid line in Figure [TO] Here we plot 
Xj = —2qjc\oj vs. the wavelength corrected ve- 
locity u cor for each line, as well as the best linear 
fit for the 9 lines (solid line) and the best linear fit 
for the 5 unsaturated, lines that are singly ionized 
(dashed line). For component 1 of the 9 lines, the 
fit slope is ^(component 1) = (186 ±6.3) x 10~ 6 , 
with x 2 /d.o.f = 264, formally a very strong detec- 
tion, inconsistent with other measurements. For 
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component 1 and the 5 lines, the fit slope is 
^(component 1) = (-4.5 ± 9.3) x 1CT 6 , with 
X 2 /d.o.f = 1.09, formally a nice null result con- 
sistent with, but much weaker than results in the 
literature, and a factor of 3 worse than the mini- 
mum possible error listed in Table [3l 

However, we do not believe either of these re- 
sults. If a changing fine structure were the ex- 
planation for the variation in values of u cor in 
the 9 line fit, then the points in Figure ITD1 would 
lie on a straight line within errors. We see from 
the figure and from the terrible x 2 /d.o.f that this 
is not the case. When the goodness of fit to a 
model is very bad, it means that either the model 
is incorrect or that there are systematic errors 
not included in the data uncertainties. In such 
cases the errors derived from the fit cannot be 
trusted, and a better way to estimate the uncer- 
tainty is to increase the errors on the data points 
until x 2 /d-0.f ~ 1. Doing this for the 9 line 
fit gives ^(component 1) = (186 ± 102) x 10~ 6 
(x 2 /d.o.f = 1.01), a null result with a more realis- 
tic error estimate, but far from the sensitivity we 
expected from our data. 

Another way to see that the first result above 
is meaningless is to do the fit including the errors 
in q. This gives an entirely different answer which 
also is a terrible fit. However, when the errors are 
scaled as above, this fit then gives nearly the same 
result as above but with larger errors. 

The problem with the 9 line fit seems to be the 
saturated lines Fc II A1608 and Al II A1670, as 
well as the aluminum lines Al III A1854 and A1862. 
Since Al III is in a different ionization state than 
the other ions, it might exist at a different physical 
location, and so maybe should be left out of the 
fits. We want to discuss in more detail the satu- 
rated lines that are far from the others, but before 
doing that we note that this pattern repeats also in 
the x vs. u CO r fits of the 2nd strongest component, 
the results of which are given in Table 2] 

The fit redshifts of the first components of the 
Fe II A1608 and Al II A1670 lines differ from 
the dashed 5 line fit by about Az ~ 5 x 10 -5 . 
Some insight into possible reasons for this can 
be gained by noting that if we force this com- 
ponent of Fe II A1608 to be the same as the av- 
erage of the unsaturated transitions, we also get 
a completely satisfactory fit to the Voigt profile: 
X 2 /d.o.f = 1487.7/1495 = .995 for the free fit, 
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Fig. 10. — Linear fit simple method to find ^ for 
the strongest component of 9 calibratable lines and 
non-iodine spectra. The solid line is the best fit for 
all 9 lines, while the dashed line is the best fit for 
the 5 singly ionized unsaturated lines. The slope 
of the 9 line fit is = (186 ±6.3) x 10~ 6 , and the 
slope of the 5 line fit is ^ = (-4.5 ± 9.3) x 10~ 6 . 
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and xVd.o.f = 1503.4/1496 = 1.004 (Q = 0.43) 
for the 3 component fit with one redshift forced 
to the average value. The reported formal fit er- 
rors on these redshifts are a z = 4.4 x 10~ 7 , sub- 
stantially smaller than the difference between the 
free fit and the forced fit. Thus we see another 
possible source of error in determining — . The 
formal fit errors seem to underestimate the true 
range of acceptable redshift values, and therefore 
overestimate the precision with which ^ can be 
determined by this method. We note that when 
we force the redshift of the first component of Fe II 
A1608 to agree with the other lines, the 2nd and 
3rd component also come into alignment. Thus 
there is a degeneracy where the redshift and b- 
value of one component can play off the redshift 
and b-values of other components. Another way 
this can be seen, is from the fact that the value 
derived for — depends greatly on the error re- 
ported for the Fe II A1608 redshift. Figure [TO] 
shows that the linear fit for — is driven by the 
tiny error on the Fe II A 1608 redshift. Increasing 
that error by a factor of ten (to roughly equal the 
errors on the other redshifts) changes the value to 
4f = (-9.3 ±9.0) x 10~ 6 . 

Examination of the Al II A 1670 line shows a 
similar problem; again the line is saturated and 
in this case there are certainly more than 3 de- 
tectable components. We are only allowing 3 total 
components so that we can compare component 
to component across other transitions where only 
3 components are detectable, but the fitting pro- 
gram can find different nearly equivalent ways to 
fit the line to 3 components since saturation means 
less shape information is available. 

While the effect is largest for the saturated 
lines, we find the same effect for the Al III A 1864 
and Al II A1862, unsaturated lines, where there are 
only 3 detectable components. Here again while 
the best fit redshift is far from the others, we can 
get a completely acceptable fit to the Voigt profile 
(% 2 /d.o.f < 1) even while forcing the first compo- 
nent to the average. Again, when one component 
is forced to agree, the other components then also 
agree. 

Thus, we note a systematic error that can give 
trouble in this type of fitting. When a system 
has two (or more) components nearby in redshift 
space, there can be a near degeneracy where the 
two redshifts can play off each other, moving sig- 



nificantly at the level of precision we are after, but 
yet still giving very good Voigt profile fits. 

Note that a linear fit to the 5 line set that in- 
cludes the q errors gives nearly the same answer 
as shown by the dashed line in Figure [TDJ This is 
to be expected since the slope of the line, ^p, is 
small. 

We can repeat the above analysis for the 
2nd and 3rd strongest components. The results 
are given in Table [4] for various sets of lines. 
In these cases we mostly find null results such 
as ^(component 2) = (14 ± 12) x 10~ 6 with 
X 2 /d.o.f = 92 for all nine lines. This is because 
the lower column densities mean larger formal fit 
errors. 

There are additional components detectable in 
some of the lines, but they are much weaker and do 
not contribute much. To get a final answer using 
this method one could average the above results, 
which actually gives a result with larger formal 
error than from the first component alone, again 
showing that systematic errors are dominating this 
procedure. 

3.4. Fitting for — : many parameter joint 
fit 

The above method, while useful for showing er- 
rors arising from fitting degeneracies, is not really 
correct because for it to work one must match up 
the various components fit in different lines in or- 
der to properly compare their redshifts. As noted, 
these fitting degeneracies allow the redshifts of 
two nearby components to be traded against each 
other and against the b-values, resulting in cor- 
relations that cause the uncertainty in the com- 
ponent redshifts to be larger than the formal fit 
errors. 

More properly, one should do a large joint fit for 
all system components simultaneously, that is tie 
together the different velocity components. This 
is standard practice and is used by Murphy et al., 
Chand, et al. Levshakov, et al., etc. Thus, for 
example, the fit to the 9 lines above by the pre- 
vious method used a total of 81 parameters: red- 
shift, column density and b- value for each of 3 
components of each of 9 lines. A global joint fit 
might might have 9 parameters for the redshift, 
column density, and b-value of each of 3 compo- 
nents, plus some column density offsets to allow 
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for different elemental abundances (1 parameter 
per component per additional element or 12 in- 
cluding Fe II, Ni II, Si II, and Al II and Al III). 
An additional b- value offset is probably necessary 
for each component of the Al III lines since these 
are in a different ionization state (3 more parame- 
ters). Then adding — as a final parameter, there 
would be a total of 25 parameters. 

The results of such a global joint fit method are 
shown in Tabled] for various sets of lines. Tabled 
shows that results we find for — vary over a wide 
range of values, from significant detection — = 
(-26 ± 8) x 1(T 6 for the Fe II A1608/Fe II A1611 
pair, to to a null result = (-0.07 ± 3.9) x 
1CP 6 for the nine-line fit. Looking over all the 
results in Table [4] we see that both the method 
used and the lines selected can make a significant 
difference in the final result. If our errors were 
under control, this should not be the case. Note 
that in most cases, the errors for the global fit 
method method hover around 4 x 10 -6 ; roughly 
half the errors of around 8 x 10~ 6 found in the 
simple method described in the previous section. 
In Table[5]we display the Voigt profile components 
that resulted from one of our joint global fits. 

3.5. Fitting for : VPFIT global joint fit 

One might worry that the puzzling results we 
find come from errors in our fitting program, so as 
a final check, we redid our calculation of — using 
the new version of VPFIT (Carswell, et al. 2008) 
which includes the possibility of doing a global 
joint fit for — . These results are also shown in 
Table H FoAhe 6 line set (Fe II A1611/A1608, 
Ni IIA1709/A1741/A1751, Si IIA1808) we stud- 
ied above we find ^ = (-9.1 ± 7.7) x 1CT 6 
(x 2 /d.o.f = 3.1, Q ="l(r 139 ), while for the set 
of all 16 lines we find find = (-17± 10) x lfr 6 
(x 2 /d.o.f = 6.0, Q — 0). Both these results are 
within the range found by our different fitting 
methods. 

4. Discussion 

We tried and failed to give a definitive answer to 
the question of whether the fine structure constant 
was different at early times in high rcdshift Ly al- 
pha systems. In order to investigate this problem 
in detail we used data taken through the Keck 
iodine cell and wrote our fitting software from 



scratch. Using the iodine cell for wavelength cal- 
ibration we found a serious source of systematic 
error that did not allow calibration of the Keck 
HIRES spectrograph to the precision needed. We 
also found degeneracies in the fitting procedures 
that added to the calibration systematic errors. 

Due to all the systematic errors we were able 
to derive various results, running from very signif- 
icant detections, to strong null limits. Does this 
imply that a meaningful measurement of or limit 
on ^£ is impossible using Keck HIRES? It is not 
clear. Perhaps more careful attention to Voigt pro- 
file fitting, as advocated by M. Murphy (private 
communication 2008) will solve the fit degeneracy 
problem. Perhaps a careful selection of absorbers 
would also help, or trying to focus on systems with 
a single component. Perhaps the wavelength cal- 
ibration errors we discovered can be corrected, or 
perhaps they will average away if a large sample 
of absorbers is considered. In this paper we raise 
these questions, but do not answer them. 

We also used a Fisher matrix technique to in- 
vestigate the minimum possible errors in — that 
our spectra's signal/noise would allow. We found 
our fit results did not exceed these limits. For ex- 
ample, for the set of 9 lines used in most of our 
analyses, the minimum possible error on ^p, as 
given in Tabled is 2.52 x 10~ 6 , a result consistent 
with our fit results. Since the calculation of min- 
imum possible errors is not difficult, we suggest 
that workers always calculate them and never re- 
port results with uncertainties smaller than the 
data theoretically can allow. 

While we have not yet looked carefully at data 
or analysis done by other workers in the field, we 
worry that some of the systematic errors and over- 
estimation of precision we found here may also be 
present in other analyses. Thus one possible expla- 
nation for the discrepant findings on ^ discussed 
in the introduction, is that several workers in the 
field are overestimating the precision of their mea- 
surements and the discrepancies reported in the 
literature are due to random fluctuations occur- 
ring within the larger, under-reported, systematic 
errors. 

At this point it is not clear how to make further 
progress in this subject using Keck HIRES, but 
other techniques such as frequency combs (Stein- 
metz, et al. 2008) may become available and be 
of use in resolving the question. In addition, pro- 
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posed new instruments (e.g., CODEX for E-ELT 
or ESPRESSO for the VLT) are being designed for 
Doppler measurement stability and will hopefully 
be be free of these problems. 
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Table 1 

Journal of PHL957 Observations 



Date 


ID 


Iodine Cell? 


Time (UT) 


Exposure (s) 


Tcmpin" (°C) 


Arc ID 


Arc Time (UT) 


Moved? b 


A Tempin (°C) 


1 nov 02 


33 


out 


5:26 


1800 


3.75 










1 nov 02 


34 


out 


5:58 


1800 


3.76 










1 nov 02 


35 


in 


6:30 


1800 


3.79 










1 nov 02 


36 


in 


7:01 


1800 


3.76 










1 nov 02 


47 




7:58 


1800 


3.61 










1 nov 02 


48 


out 


8:30 


1800 


3.68 










1 nov 02 


49 


in 


9:03 


1800 


3.68 










1 nov 02 


50 


in 


9:34 


1800 


3.78 










1 nov 02 


60 


out 


10:34 


1800 


3.71 










1 nov 02 


61 


out 


11:06 


1800 


3.75 










1 nov 02 


62 


in 


11:39 


1800 


3.71 










3 oct 04 


67/3-0 


out 


9:18 


3600 


4.26 


66 


09:15 


no 


0.042 


3 oct 04 


68/3-1 


in 


10:31 


3600 


4.21 


66 


09:15 


no 


0.097 


3 oct 04 


69/3-2 


in 


11:33 


3600 


4.07 


66 


09:15 


no 


0.236 


3 oct 04 


70/3-3 


out 


12:35 


3600 


4.00 


66 


09:15 


no 


.0306 


4 oct 04 


1096/4-0 


out 


9:25 


3600 


3.00 


1144 


15:35 


yes 


-0.153 


4 oct 04 


1098/4-1 


in 


10:56 


3600 


2.97 


1144 


16:35 


yes 


-0.125 


4 oct 04 


1099/4-2 


in 


11:57 


3600 


3.01 


1144 


16:35 


yes 


-0.167 


4 oct 04 


1100/4-3 


out 


12:58 


3600 


2.93 


1144 


16:35 


yes 


-0.083 


5 oct 04 


2094/5-0 


in 


8:25 


3600 


2.87 


2026 


3:01 


yes 


0.125 


5 oct 04 


2095/5-1 


out 


9:27 


3600 


2.86 


2107 


15:46 


yes 


0.430 


5 oct 04 


2096/5-2 


out 


10:29 


3600 


2.91 


2107 


15:46 


yes 


0.375 


5 oct 04 


2097/5-3 


in 


11:30 


3600 


3.62 


2107* 


15:46 


yes 


-0.333 


5 oct 04 


2098/5-4 


out 


12:32 


2700 


3.55 


2107 


15:46 


yes 


-0.264 



a Tcmpcrature inside the the HIRES enclosure. 

b Whcthcr or not the grating was moved between Th/Ar arc and data exposures 
*This exposure was also calibrated with Th Ar arc 2026; see text 



Table 2 
Line Information 



Transition 


Echcllc order 


q value (cm x ) a 


min a v (m/s) (2004 data) 


Iodine cell coverage? 


Fe II A1608.45 


67 


-1030 ± 300* 


25.0 


yes 


Fe II A1611.20 


67 


1560 ± 500* 


153 


yes 


Al II A1670.79 


65 


270±?* 


34.0 


yes 


Ni II A1709.60 


63 


-20 ± 250** 


83.1 


yes 


Ni II A1741.55 


62 


-1400 ± 250** 


48.7 


yes 


Ni II A1751.92 


62 


-700 ± 250** 


70.8 


yes 


Si II A1808.01 


60 


520 ± 30** 


36.4 


yes 


Al III A1854.72 


58 


458 ± 2*** 


76.0 


yes 


Al III A1862.79 


58 


224 ± 1*** 


125 


yes 


Si II A1526.71 


71 


50 ± 30** 


28.8 


no 


Zn II A2026.14 


53 


2488 ± 25 f 


129 


no 


Zn II A2062.66 


52 


1585 ± 25 + 


229 


no 


Cr II A2056.26 


52 


-1030 ± 150** 


89.9 


no 


Cr II A2062.24 


52 


-1168 ± 150** 


102 


no 


Cr II A2066.16 


52 


-1360 ± 150** 


143 


no 


Fe II A2344.21 


46 


1540 ± 400* 


41.7 


no 



a q-values marked * are from Dzuba, ct al (2001); marked ** from Porsev, et al (2007); marked * from Murphy, 
ct al. 2001a; marked * from Savukov & Dzuba (2007); marked *** from Dzuba &c Flambaum (2008). 
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Table 3 

Minimum possible errors in 4r 



Line set 


-^p(min) (no error in q) 


^^-(min) (including error in q) 


■^p(min) (no error in q; w/o 12 exposures) 


All 16 lines (2002 and 2004 data) 


1.17 x 10~ 6 


1.32 X 10~ 6 




All 16 lines 


1.21 x lO" 6 


1.36 x lO" 6 


1.50 X 10~ 6 


9 calibratablc lines 


2.00 x 10- 6 


2.14 x 10~ 6 


2.52 X 10~ 6 


7 calibratable, unsaturated lines 


2.69 x 10" 6 


2.77 x 10~ 6 


3.28 x 10" 6 


Fe 11 A1608/A1611 pair 


6.18 x 10~ 6 


6.31 x 10~ 6 


6.20 x 10" 6 



Note. — Values arc for 2004 data only and include exposures with and without the iodine cell in place, except where noted. 



Table 4 
Fit results for — 

a 



Method 


Line set 




X 2 /d.o.i 


probability (Q) 


Simple (comp 1) 


the (5) 


-4.5 ± 9.3 


1.09 


0.35 


Simple (comp 1) 


(5)+Fc 11 A1608 


213 ± 7.1 


323 


10 -27S 


Simple (comp 1) 


(5)+Al III A1854/A1862 


6.7 ± 9.2 


22 


2 x 10~ 22 


Simple (comp 1) 


(5)+Fe II A1608 +A1 III A1854/A1862 


237 ± 6.8 


248 


10 -318 


Simple (comp 1) 


the (9) 


186 ± 6.3 


264 





Simple (comp 2) 


the (5) 


-7.1 ± 12 


0.66 


0.57 


Simple (comp 2) 


(5)+Fc II A1608 


15 ± 12 


75 


1Q -63 


Simple (comp 2) 


the (9) 


14 ± 12 


92 


1Q -136 


Simple (comp 3) 


(5)+Fc II A1608 


25 ± 51 


3.8 


0.004 


global joint fit 


Fe II A1608 and Fe II A1611 pair 


-26 ± 8.1 


0.96 




global joint fit 


the (5) 


-3.3 ± 4.3 


0.80 




global joint fit 


(5) + Fe II A1608 


-18 ± 3.8 


0.92 




global joint fit 


(5) + Al III A1854/A1862 


-6.3 ± 4.3 


0.83 




global joint fit 


the (9) 


-0.70 ± 3.9 


1.01 




VPFIT global (w/o iodine) 


(5) + Fe II A1608 


-9.1 ± 7.7 


3.1 


10 -iaa 


VPFIT global (w/o iodine) 


all 16 


-17 ± 10 


6.0 






NOTE.— The five (5) lines included in almost all the fits are Fe II A1611, Ni II A1709/A1741/A1751, and Si II A1808. 
The nine (9) lines are the (5) plus Fe II A1608, Al III A1854/A1862 and Al II A1670. The VPFIT spectra were handled 
differently and did not include the iodine wavelength correction, so VPFIT results are not expected to be the same as 
our joint global fit results. 



Table 5 

Component fit values for a 3 component joint global fit of 9 lines. 



Component 


redshift z 


velocity width b (km/s) 


log column density log(iV) 


Comp 1 


2.3090291 ± 9.58 x 10~ 7 


6.41 ± 0.05 


14.76 ± 0.029 


Comp 2 


2.3091517 ± 1.68 x lO" 6 


7.75 ± 0.12 


14.563 ± 0.035 


Comp 3 


2.3094793 ± 9.49 x 10~ 7 


16.44 ± 0.11 


14.039 ± 0.021 



NOTE. — The 9 lines included in the fit are the ones listed in the Introduction. Each component of 
each element has an additional fit parameter (not listed) which is a column density offset to allow for 
different elemental abundances. This fit resulted in a fit value of = (—0.070 ± 3.9) X 10~ 6 and 
X 2 /d.o.f = 14138/13891 = 1.01. 
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